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As a generation of the classical percolation, clique percolation focuses on the connection of cliques in a graph, 
where the connection of two fc-cliques means that they share at least I < k vertices. In this paper, we develop a 
theoretical approach to study clique percolation in Erdos-Renyi graphs, which gives not only the exact solutions 
of the critical point, but also the corresponding order parameter. Based on this, we prove theoretically that 
the fraction f of cliques in the giant clique cluster always makes a continuous phase transition as the classical 
percolation. However, the fraction (p of vertices in the giant clique cluster for / > 1 makes a step-function-like 
discontinuous phase transition in the thermodynamic limit and a continuous phase transition for I = \. More 
interesting, our analysis shows that at the critical point, the order parameter <pc for / > 1 is neither 0 nor 1, but a 
constant depending on k and 1. All these theoretical findings are in agreement with the simulation results, which 
give theoretical support and clarification for previous simulation studies of clique percolation. 
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I. INTRODUCTION 

Percolation theory has been widely used in the study of 
complex systems and complex networks[l]. In general, per¬ 
colation theory considers the emergence of the giant cluster 
formed by occupied vertices or edges in a graph, when pos¬ 
sible vertices or edges are occupied with a given probability 
0 < p < 1. The characteristics of percolation transitions in 
different graphs can give many useful results for the structural 
design and dynamic analysis of real network systems. For 
example, the percolation transition on graphs with degree dis¬ 
tribution pk ~ kf {2 < y < J) shows that such a system has a 
critical point Pc —> 0 for random vertex occupation and a very 
large pc for targeted vertex occupation (unoccupied large de¬ 
gree vertices with a high probability)[2-4]. This indicates that 
real network systems with such a degree distribution are ro¬ 
bustness against random failure, but extremely fragile against 
a targeted attack. 

In addition, some modified percolation models have been 
introduced into the study of complex networks, such as k-core 
percolation[5], bootstrap percolation[6], and percolation in in¬ 
dependent networks [7]. All these models play an important 
role in the theoretical studies of the cascade and diffuse pro¬ 
cesses on networks. At the same time, some new phenomena 
have also been found in the modified percolation models [8, 9], 
such as the discontinuous percolation transition[5-7]. 

To identify overlapping communities in large real networks, 
Derenyi and co-workers have proposed a dependent percola¬ 
tion model, called clique percolation[ 10, 11]. A clique is a full 
connected subset of vertices of a graph; such a subset with 
k vertices is often called a k-clique. In their model, two k- 
cliques are regarded as adjacent, if they share k - 1 vertices. 
As classical percolation in Erdos-Renyi (ER) graphs, k-clique 
percolation considers the behavior of clusters formed by con¬ 
nected k-cliques for a different connection probability p of 
edges. In network science, this approach has been proven suc¬ 
cessful in many applications for the analysis of network struc¬ 
ture, such as the study of cancer-related proteins in protein in¬ 


teraction networks[12], the analysis of stock correlations[13], 
and social networks[14]. 

As a percolation model, clique percolation itself provides 
a set of very interesting problems. The simulation results 
in ref.[l 1] indicate that the fraction of vertices in the largest 
k-clique cluster makes a discontinuous percolation transition 
with increasing connection probability p; however, the frac¬ 
tion of k-cliques in the largest k-clique cluster demonstrates 
a continuous percolation transition. Bollobas and Riordan 
give a rigorous mathematical result of the critical point for 
a more general percolation model, in which two k-cliques are 
regarded as adjacent if they share l{- 1,2, ...,k - 1) or more 
vertices[15]. The Monte Carlo simulation indicates that for 
different k and I, this general clique percolation model could 
demonstrate different critical phenomena[ 1 6] . Moreover, the 
percolation of weighted and directed cliques is also studied as 
a community finding method in real-world networks [17, 18]. 

In this paper we will present a theoretical study of the gen¬ 
eral clique percolation in ER graphs and give a theoretical 
support and clarification of previous studies. The paper is 
organized as follows. In the next section some distribution 
characteristics of cliques in ER graphs will be obtained. In 
Sec. Ill, we will solve the clique percolation model. In Sec. 
IV, we examine the finite size scaling at the critical point. In 
the last section we summarize our findings. 


II. CLIQUE DISTRIBUTION 

Eor the study of the percolation in highly clustered net¬ 
works, one usually needs to know the distribution character¬ 
istics of the clusters [19-21], so we begin our study with the 
clique distribution.A k-clique is composed by k vertices and 
( 2 ) edges. Eor ER graphs, each possible edge occurs indepen¬ 
dently with probability 0 < p < 1, so the expected number of 


A:-cliques in an ER graph with N vertices is 


2 





( 1 ) 


where gives the total number of cliques that can be formed 
in the graph. For — > oo, eq.(l) can be rewritten as 



k(k-l)/2 


( 2 ) 


On average, the number of A:-cliques containing a given vertex 
is 





pKk-l)/2^ 


(3) 


For simplicity, we define Zk as the average A:-clique degree of 
vertices. Moreover, the ^--clique degree distribution can be 
written as 


gkii) 








(4) 


where gives the maximum number of A:-cliques that 

could contain a given vertex. When —> oo, gf,(i) takes the 
form 

8k(i) = (5) 

i\ 

For k — 2, gk(i) is the degree distribution of ER graphs and 
Z 2 - Np is the average degree. 

Following an edge, there is only one vertex. However, fol¬ 
lowing an /-clique, we may find more than one k-clique or 
none (/ < k). It is readily known that in ER graphs, a k- 
clique can be built from an /-clique with probability ^( 2 )^( 2 ) = 
p(k-i)(,k+i-\)i 2 ^ Therefore, the probability distribution of the 
number of new k-cliques reached by following an /-clique of 
a k-clique can be written as 

,(;•) = yp(k-i){k+i-\)l2^ j'j _ ^(*-0(^+<-l)/2^ 

^ (6) 
where gives the total number of choices to build a new 
k-clique from an /-clique of a k-clique. When N 00 , 
also obeys the Poisson distribution 

fkj(j) = (7) 

j'- 

Here, = Y,jfk,iii)i = - 0! is the aver¬ 

age number of new k-cliques reached by following an /-clique 
of a k-clique. 


III. GENERAL FORMALISM 




FIG. 1. Sketches of the mapping from clique percolation to nomal 
percolation, (a) fe = 3 and / = 2. (b) fc = 3 and I - 1. (c) fe = 2 and 
/ = 1. Vertices of graph G' correspond to k-cliques of graph G and 
two vertices of graph G' are connected if the corresponding k-cliques 
of graph G share I or more vertices . Note that not all the vertices in 
graph G' can be connected directly when edges are added in graph 
G, such as vertices 5 and 7 of graph G' in (b). Therefore, graph G' 
can not be entirely considered an ER graph, and the two graphs are 
not equivalent even for k = 2 and / = 1. Specially, when graph G is a 
full connected graph, graph G' will have a special topology. 


vertices in graph G' are connected if the corresponding k- 
cliques of graph G share / vertices or more (see Fig.l). Ob¬ 
viously, the clique percolation in graph G corresponds to a 
normal percolation in graph G'. 

An /-clique {I < i < k) of a k-clique in graph G could 
map to more than one edge or none in graph G', which sat¬ 
isfies the distribution (7). Therefore, the degree distribution 
of graph G' is also a Poisson distribution with average de¬ 
gree z' = Zt'/ However, it is necessary to point out 

that graph G' can not be entirely considered an ER graph (see 
Fig.l). 

Furthermore, when the average degree z' takes a finite size, 
the leading order of z' must have size y^j ~ 0(1). This yields 
{^^yk,i has the order of 0 (aHt 7), which will vanish for 
large N. So when we consider the connection of graph G', the 
edges mapped from /-cliques with / > / can be ignored and the 
average degree of graph G' can be simplified as z' - [fjykj- 

Next we calculate the critical point based on the branch pro¬ 
cess of graph G'. As we know, if a giant cluster exists in 
graph G', the reproduction number of the branch process in 
such graph must be larger than one[22, 23]. That is, when 
we arrive at a vertex by following an edge, there must be ex¬ 
cess edges leading to other vertices. Considering graph G, a 
k-clique has (Jj - 1 excess /-cliques and each /-clique could 
map to yicj edges of graph G' on average. Thus, the critical 
point satisfies 



Let G be the ER graph we focus on. Then we consider a 
graph G', whose vertices are the k-cliques of graph G. The 


yk,i = 1 , 


(8) 
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that is 


Pc = , 


(9) 


where ^ 


{k-iy. 

0-1 


This result is the same as the one 


found in ref. [15]. When I - k - 1, pc reduces to the critical 
point [(^-l)A^]“'^^*"'Obtainedinrefs.[24, 25]. For^ = 2 and 
I - 1, Pc - I IN is the critical point of the classical percolation 
in ER graphs. In addition, for / < A: - 1, if the edges mapped 
from i-cliques with / > 1 are considered, a correction term 


) should be added at the critical point (9). 

The order parameter usually used in percolation theory is 
the fraction of vertices in the giant cluster; for clique perco¬ 
lation, it is the fraction of vertices in the giant clique clus¬ 
ter, labeled (p. As pointed out in ref.[l 1], there is another or¬ 
der parameter i/^ that can be used in clique percolation, which 
gives the fraction of cliques in the giant clique cluster. Note 
that (p can also be understood as the probability that a ran¬ 
domly chosen vertex belongs to the giant clique cluster and 
ip is the probability that a randomly chosen clique belongs to 
the giant clique cluster. To resolve the clique percolation, we 
need an auxiliary parameter (f, which expresses the probabil¬ 
ity that an /-clique of a /:-clique can lead to the giant clique 
cluster. As the branch process discussed above, the giant clus¬ 
ter in graph G' has tree-like structure near the critical point, 
so the connected cliques in graph G must also have tree-like 
structure[l 1, 15, 24]. 

Following an /-clique of a k-clique, if we can not find the gi¬ 
ant clique cluster, all the new k-cliques we find can not lead to 
the giant clique cluster. Thus, we can obtain a self-consistent 
equation for ip. 


( 10 ) 

J=o 


where (1 - ^)('^^* is the probability that all the other /-cliques 
of the k-clique reached by following the /-clique can not 
connect to the giant clique cluster. Substituting eq.(7) into 
eq.(lO), we have 


^ 1 _ 


= 1 -e 


ip/PcV 


(11) 


Solving this equation for the condition of ^ > 0, we will also 
find the critical point (9). 

Next we obtain the order parameters (p and ij/ by using ip. 
For a randomly chosen k-clique that does not belong to the 
giant clique cluster, all the /-cliques belonging to this k-clique 
can not lead to the giant clique cluster. Thus, ip satisfies 

1-(A = (1-^)®. (12) 


Together with eq.(ll), we find that the order parameter ip 
makes a continuous phase transition at the critical point (9). 

The simulation results of the order parameter >p for k = 3 
are shown in Fig. 2, which are in good agreement with the the¬ 
ory prediction of eqs.(l 1) and (12). One can find that the be¬ 
havior of the order parameter ip is much the same as that of the 
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FIG. 2. (Color online) Simulation results of the order parameter p for 
different graph sizes N. (a) k = 3 and / = 2. (b) k = 3 and / = 1. For 
both cases, the order parameter p demonstrates a continuous phase 
transition. The theory prediction is obtained from eqs.(l 1) and (12). 


classical percolation. This is because that the order parameter 
^/r just describes a normal percolation in the mapped graph G'. 

We now turn our attention to the order parameter p that is 
usually used in percolation theory. For a randomly chosen 
vertex that does not belong to the giant clique cluster, all the 
k-clique degrees of the vertex can not lead to the giant clique 
cluster. Thus, p satisfies 

j=0 

Note that the probability in the square brackets of eq.(13) 
should express the excess connection of the k-clique degree 
of the vertex, so the /-cliques containing the vertex are not 
counted. Substituting eq.(5) into eq.(13), we have 

Specifically, fork = 2 and / = 1, eqs.(ll) and (14) are just the 
equations for the classical percolation in ER graph[22]. 

For the discussion of the case N —> oo near the critical point, 
we rewrite eq.(14) as 

(15) 


where 


1 


(k-1)! 


(k-/)! 


- 1 


wiN) = N 


(A-llW-l) 

k-t-Z-l 


(16) 

(17) 


For / = 1, w{N) — 1 and then pc ~ (pc^ which indicates that 
p makes a continuous phase transition as <p. For I > \, pc ~ 
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FIG. 3. (Color online)Simulation results of the order parameter 0 for 
different graph sizes N. (a) ^ = 3 and / = 2. The phase transition of 
the order parameter (p becomes discontinuous in the thermodynamic 
limit. The inset shows the details close to the percolation threshold. 
{h)k = 3 and 1=1. The order parameter (p demonstrates a continuous 
phase transition. 



N 


FIG. 4. (Color online) Simulation results of the order parameters at 
the critical point. Here 4fc and 4’c scale as a power of graph size N. 
The corresponding theoretical results are as follows: (a) >]jc{k = 3,1 = 
1) ~ = 2) ~ and i/r,(fe = 4,/ = 3) ~ 

and (b) (pc{k = 3,1 = 1) ~ For I > I, (p, behaves as (p, = 

(pco + 0(N^‘) for finite size systems. 


w{N)<pc. This means that the critical behavior of (p depends on 
the relative sizes of w{N) and ip,. 

As shown in Fig. 3, the phase transition of (p for I = 1 
demonstrates a continuous phase transition, which is agree¬ 
ment with our theoretical prediction well when A — > oo. For 
I > I, the phase transition of the order parameter cp tends to be 
a step function for A —» oo with a nearly constant (p, for dif¬ 
ferent graph sizes A. An explanation will be given when we 
get the finite size scaling of p, in the next section. Otherwise, 
it should be noted that even though the model reduces to the 
classical percolation {k = 2,1 = 1), graphs G and G' are also 
different (see Fig. 1 (c)), so the order parameters i/r and cp are 
always inequivalent. 


IV. AT CRITICAL POINT 


It is known that, in ER graph, when p < Pc almost surely 
each cluster has size logN, at p = p,, the largest cluster has 
size of order A^^^, and at p > p,, there exists a single giant 
cluster of size of order A and all other clusters have size of or¬ 
der logN[23]. As discussed above, when a clique percolation 
occurs in graph G, there is a corresponding normal percola¬ 
tion in the mapped graph G'. Thus, the number of cliques in 
the giant clique cluster could also satisfy [11] 

C logC, p < Pc, 

C2/3, p = p„ (18) 

C, P> Pc. 


Using eqs.(l) and (9), we can find the total number of cliques 
at p = Pc, 


Cc ~ A‘+'-i. 


(19) 


Then, the giant clique cluster at the critical point C* scales 
as ^ Note that the giant clique cluster has tree¬ 

like structure at the critical point, which means the number 
of cliques in the giant clique cluster C* can not grow faster 
than the number of vertices. This yields 


C! 


N, 


2kl 


3(<:+/-l) 


' 2kl 


3(*:+/-l) 


< 1 
> 1 . 


( 20 ) 


Together with eq.(19), we have 



^-.W/3(A:+;-l)^ 

j^-(k-m-i)i(k+i-i) _ _!_ 

^ ' w(N) ’ 


2kl 


2(k+l-l) 

2kl 

3(k+l-V) - 


< 1 , 


> 1 . 


( 21 ) 


From eq.( 12), we find that the order parameter if/c ~ Pc, that 
is. Pc also satisfies eq. (21). Then, replacing p by the order of 
A in eq.(15), we have 


(0, P<Pc 

(p = \ l-e^, p^ Pc (22) 

(1, P>Pc. 

Here, k is a constant determined by k and 1. This means that 
the order parameter (p makes a special phase transition that 
when p < Pc, (p = 0; when p > p,, (p - I', and when p = pc, 
the order parameter is a constant 0 < (pc < 1. Furthermore, 


C* ~ 
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FIG. 5. (Color online) Simulation results of the clique cluster num¬ 
ber distribution at the critical point, where a clique cluster size is 
measured by the number of (a) cliques and (b) vertices in the cluster. 
For these plots, t = t' = 5/2 and 6 = 6'= 0.9 are used. 


when I = (pc ~ ipc i.e., <pc scales as the same power of N 
with ipc- 

In Fig.4, we give the finite size scaling at the critical point. 
In accord with our analysis, the order parameters (pc for I = 1 
and ipc are in agreement with eq.(21) and the order parameter 
(pc at the critical point for / > 1 is a constant for — > oo. For 
finite sizes, (pc for I > 1 behaves as (pc - (pco + (see 

Fig.4 (b)). 

In addition, for large k, the edges from the high-order term 
of yii.i (i > 0 will have a role in the percolation process of a 
finite size system, so the finite size scaling discussed above 
gives only approximate results for the case of large k and 
I < k-\. However, for/ = A:-1, all the results discussed above 
are accurate (see the case of k = A and / = 3 in Fig.4). Fur¬ 
thermore, another possible reason for the deviation between 
the theoretical results and the simulation results in Fig.4 is 
that graph G' can not be identical to an ER graph. 

To clarify the discontinuous percolation transition of (p for 
/ > 1, the clique cluster number distributions at the criti¬ 
cal point are shown in Fig. 5. The scaling ansatz are set as 
rjsipc) ~ s^'^fisjN^) and rjnipc) ~ rr'^'finjN^'). Here, s 
means the number of cliques in a clique cluster, and n the 
number of vertices. From Fig. 5, we find that the two distri¬ 
butions give the same critical exponents r = r' = 5/2 and 
6 — 6' 0.9. This indicates that there is no essential distinc¬ 

tion between the two percolation transitions measured by ip 
and (p. The different behaviors of the order parameters ^ and 
p are mainly caused by the quantitative relation between the 
total number of cliques and the total number of vertices. 


V. CONCLUSION 

In this paper, we have established three mean field equa¬ 
tions for clique percolation in random graph. Using these 
equations, we not only covered the critical points found in pre¬ 
vious studies, but also obtained the corresponding order pa¬ 
rameter in the thermodynamic limit. Based on this, we prove 
theoretically that the order parameter p for I > 1 makes a step- 
function-like phase transition in the thermodynamic limit and 
a continuous phase transition for / = 1. More interesting, our 
analysis showed that in the thermodynamic limit, the order pa¬ 
rameter p is neither 0 nor 1, but a constant depending on k and 
1. Through simulation we found that the order parameter p be¬ 
haves as pc - Pcq+0{N^^) at the critical point, which supports 
our theoretical study. In addition, our analysis also proved that 
the order parameter p always makes a continuous phase tran¬ 
sition as the classical percolation. Through analysis of the 
clique cluster number distribution, we found that there is no 
essential distinction between the two processed measured by 
p and p and the different behaviors are mainly caused by the 
quantitative relation between the total numbers of cliques and 
vertices. 

Although only ER graphs were studied in this paper, a sim¬ 
ilar discussion can be had for other networks if we know the 
distributions fk,i{j) and gk{i)- These results give a theoreti¬ 
cal clarification for the clique percolation, which will enhance 
the understanding of the emergence of community structures 
in complex networks and the clique percolation itself. 
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